Training and Testing Low-degree Polynomial Data Mappings via Linear SVM
نویسندگان
چکیده
Kernel techniques have long been used in SVM to handle linearly inseparable problems by transforming data to a high dimensional space, but training and testing large data sets is often time consuming. In contrast, we can efficiently train and test much larger data sets using linear SVM without kernels. In this work, we apply fast linear-SVM methods to the explicit form of polynomially mapped data and investigate implementation issues. The approach enjoys fast training and testing, but may sometimes achieve accuracy close to that of using highly nonlinear kernels. Empirical experiments show that the proposed method is useful for certain large-scale data sets. We successfully apply the proposed method to a natural language processing (NLP) application by improving the testing accuracy under some training/testing speed requirements.
منابع مشابه
Low-degree Polynomial Mapping of Data for SVM
Non-linear mapping functions have long been used in SVM to transform data into a higher dimensional space, allowing the classifier to separate linearly inseparable data. Kernel tricks are used to handle the huge number of features of the mapped data point. However, the training/testing for large data is often time consuming. Following the recent advances in training large linear SVM (i.e., SVM ...
متن کاملEvaluation of Sentinel-1 Interferometric SAR Coherence efficiency for Land Cover Mapping
In this study, the capabilities of Interferometric Synthetic Aperture Radar (InSAR) time series data and machine learning have been evaluated for land cover mapping in Iran. In this way, a time series of Sentinel-1 SAR data (including 16 SLC images with approximately 24 days time interval) from 2018 to 2020 were used for a region of Ahvaz County located in Khuzestan province. Using InSAR proces...
متن کاملAn Approximate Approach for Training Polynomial Kernel SVMs in Linear Time
Kernel methods such as support vector machines (SVMs) have attracted a great deal of popularity in the machine learning and natural language processing (NLP) communities. Polynomial kernel SVMs showed very competitive accuracy in many NLP problems, like part-of-speech tagging and chunking. However, these methods are usually too inefficient to be applied to large dataset and real time purpose. I...
متن کاملSupport vector machine classification for large datasets using decision tree and Fisher linear discriminant
The training of a support vector machine (SVM) has a time complexity between O(n) and O(n). Most training algorithms for SVM are not suitable for large data sets. Decision trees can simplify SVM training, however the classification accuracy becomes lower when there are inseparable points. This paper introduces a novel method for SVM classification. A decision tree is used to detect low entropy ...
متن کاملکاربرد ماشین بردار پشتیبان در طبقهبندی کاربری اراضی حوزه چشمه کیله- چالکرود
Classification of land use extraction always been one of the most important applications of remote sensing and why different methods are created. Over time and with greater accuracy were developed more advanced methods that increase the accuracy and the extraction classes that were closer together in terms of quality are better. SVM is one of these methods in the study of this method for the ex...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of Machine Learning Research
دوره 11 شماره
صفحات -
تاریخ انتشار 2010